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Sieves,  Signal  Extraction,  and  Design  (Unclassified) 

1.  Summary :  See  DD  1473,  item  19. 

2,3.  Research  objectives,  status  of  work:  In  the  description  that 
follows,  citations  indicated  by  [lj,  [2],  etc.,  refer  to  the  list 
given  in  item  (4)  below. 

a .  Sieve  estimation  for  Gaussian  processes . 

Let  X  =  (Xt,  t  e  T}  be  a  Gaussian  process  with  mean  function  m 

and  covariance  function  R.  The  PI  has  developed  sieve  estimators  m 

and  R  to  estimate  m  when  R  is  known,  and  R  when  m  is  known.  The 
construction  and  properties  of  these  estimators  allow  T  to  be  an 
arbitrary  set.  Thus  this  method  has  very  wide  applicability. 

In  the  case  of  estimating  m,  the  only  special  assumption  needed 
is  that  m  belong  to  a  known  countably-dimensioned  subspace  of  the 
reproducing  kernel  Hilbert  space  ( RKHS )  K(R).  For  the  estimation  of 
R  when  m  is  known,  an  additional  assumption  is  needed  concerning  the 
bivariate  expansion  of  R.  In  either  case  the  data  consists  of  n 
replicates  (trajectories)  of  the  process.  A  discussion  of  this  as  a 
design  constraint  is  given  in  [7],  an  extended  version  of  [4]. 

In  both  cases,  the  estimators  are  strongly  consistent  in  a 
natural  norm  as  long  as  a  sieve  parameter  d  (roughly,  the  number  of 
terms  in  the  estimator)  grows  as  o(n),  and  are  consistent  in 
probability  and  in  mean  square  locally  if  d  grows  as  0(n).  They  are 
also  asymptotically  unbiased  as  d  goes  to  infinity.  Details  are 
given  in  papers  [2]  and  [41.  Paper  [1]  shows  how  the  two  estimators 
can  be  combined,  and  applies  the  method  in  the  Banach  space  setting. 

There  is  a  good  deal  of  latitude  in  the  construction  of  these 
estimators,  depending  as  they  do  on  an  arbitrary  choice  of  "basis" 
functions  in  an  appropriate  Hilbert  space.  Using  this  and  elementary 
tools  (essentially  Gram-Schmidt  orthogonal ization ) ,  it  is  easy  to 
construct  an  estimator  of  the  mean  which  is  computationally  simple 
and  which  requires  only  d  time  "sites"  t ^ ,  ...,  t^  for  observing  the 

process.  Computer  simulations  (still  underway)  show  that  this  method 
allows  one  to  recover  significant  features  of  the  underlying  mean 
function  (the  "trend"  or  "signal" )  easily.  (Theory  requires  that  a 
be  limited  by  n,  the  number  of  replicates  observed;  greater  accuracy 
can  be  attained  by  using  more  replicates.  The  simulation  study  so 

g 

far  indicates  that  d  should  oe  about  n'  . ) 

b .  On  a  zero-one  law  of  Driscoll,  and  optimal  signal 
extraction . 

We  are  given  an  observable  signal  Yt  -  S_  +  N^,  t  e  T,  where  S 

and  N  are  independent  zero-mean  Gaussian  processes  (signal  and  noise) 
with  known  covariances.  Under  certain  conditions,  Driscoll  [1975j 
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has  shown  that  an  optimal  method  to  extract  the  signal  S  from  X  is  to 
use  the  process  S  defined  by  S„  =  ,  where  is  the 

sigma-algebra  generated  by  (Xfc,  t  e  T}.  These  conditions  concern  the 
topology  of  T  and  the  continuity  of  the  covariances . 

The  PI  has  generalized  this  framework  by  letting  T  be  an 
arbitrary  set  and  by  not  assuming  any  particular  relationship  between 
X  and  S,  so  that  the  stochastic  signal  can  be  corrupted  by  noise  in  a 
more  arbitrary  fashion.  Optinality  can  still  be  defined  as  in 

Driscoll  by  requiring  that  the  sample  paths  S  and  S  belong  to  a 

RKHS  H{ R)  for  and  appropriate  positive  definite  kernel  R.  Closeness 

is  now  defined  by  IIS  -S  II,  using  the  norm  of  H(R).  To  do  this,  we 

take  R  to  dominate  the  covariance  X  of  S  in  the  sense  that 

»(K)  c  H( R) ;  we  denote  this  by  K  <<  R.  If  R  n-dominates  K  (Fortet 

n 

[1974]),  we  write  K  <<  R.  The  plan  is  to  show: 

n 

i.  If  K  «  R  then  P(S  e*(R))  =  0  or  1 ,  and  is  1  iff  K  <<  R; 


ii.  if  K  «  R  then  P(S.  e  H(R))  *  1;  and 

iii.  EllS  -S  il2  <  ENS. -nil2  a.s.  for  ail  h  €  #(R);  and  to  give  tr.e 
value  for  the  left-hand  side  of  (*). 

Again,  in  all  cases,  no  assumptions  are  made  on  T. 

The  "zero"  part  of  item  (i)  has  been  proved  in  full  generality, 
while  the  "one"  part  and  item  (ii)  have  been  proved  under  a 
separability  assumption  (see  the  AFOSR  technical  report  Beder  [1986.  ' 
that  may  not  be  removable.  The  value  of  the  left-hand  side  of  (*)  /[( 

has  been  derived  as  the  trace  of  an  operator  (3eaer  [1986]),  but  the  j  V 
inequality  remains  to  be  proved.  V  v 

This  work  also  has  application  to  certain  stochastic  design  — 

problems;  see  Ylvisaker  [1987].  — 

c .  Experimental  Design . 

The  PI  has  given  c  definitive  analysis  [5]  of  confounding 
patterns  in  two-factor  experiments  in  which  one  factor  has  a  prime 
number  of  levels.  The  existence  of  a  generalized  Hadamara  matrix 
implies  the  existence  of  an  orthogonal  decomposition  of  interaction 
of  a  certain  type.  Moreover,  the  converse  holds  at  least  in  the  case 
that  the  prime  (number  of  levels)  is  2  or  3.  When  the  prime  is  2, 
then,  the  problem  of  confounding  is  equivalent  to  the  classical 
Hadaraard  matrix  conjecture.  The  method  used  makes  no  assumption  a 
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priori  about  the  algebraic  or  geometric  structure  of  the  set  of 
levels  attached  to  each  factor  in  the  experiment.  Background  to  the 
method  is  given  in  [8]. 

It  is  of  particular  interest  that  the  problem  of  confounding  in 
the  simplest  two-factor  asymmetric  design  is  equivalent  to  a 
difficult  combinatorial  conjecture,  and  thus  far  deeper  than  the 
problem  in  classical  symmetric  factorials.  This  depth  is  not 
revealed  by  standard  methods  of  analyzing  confounding. 

The  classical  Hadamard  matrix  problem  may  be  viewed  as  that  of 
const* acting  a  maximal  set  of  nxi  vectors  consisting  of  i's  and  -1's. 
Such  a  maximal  set  can  of  course  have  no  more  than  n  elements.  It  is 
known  that  if  a  maximal  set  of  size  n  exists,  then  n  =  2  or  n  is  a 
multiple  of  4;  the  conjecture  is  the  converse,  namely  that  if  n  is  a 
multiple  of  4  then  there  exists  a  maximal  set  of  size  n.  The  ?I  has 
shown  that  if  n  =  2  (mod  4)  then  all  maximal  sets  are  of  size  2. 

Also,  a  computer  search  routine  written  by  the  PI  indicates  that  if  n 
is  a  multiple  of  4  then  the  size  of  any  maximal  set  is  also  a 
multiple  of  4.  The  empirical  evidence  from  this  search  is  for  n  =  4, 
8,  ...,  24,  values  for  which  the  conjecture  is  known.  These  results 

involve  maximal  orthogonal  sets  of  possibly  non-maximum  cardinality, 
a  new  approach.  Whether  this  will  shed  further  light  on  the 
conjecture  remains  to  be  seen. 

d .  Miscellaneous . 

Paper  [3]  and  the  technical  report  [6]  were  also  completed 
under  this  grant. 
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( iv)  "Sieve  estimation  for  Gaussian  processes,"  Colloquium, 
Dept,  of  Mathematics,  University  of  California,  Irvine, 
June  10,  1986. 


5 


(v)  "On  the  zero-one  laws  of  Xallianpur  and  Driscoll," 
Colloquium,  Dept,  of  Mathematics,  U.C.  Irvine,  March  4, 
1987. 

(vi)  "Sieve  estimation  for  Gaussian  processes,”  Colloquium, 
Div.  of  Statistics,  U.C.  Davis,  March  5,  1987. 

(vii)  "On  the  zero-one  laws  of  Kallianpur  and  Driscoll,"  IMS 
Regional  Meeting,  Dallas,  TX,  March  24,  1987.  (IMS  Ads. 
199-17) . 

( viii )" Inference  for  Gaussian  processes,"  Dept,  of 

Statistics,  University  of  Chicago,  May  16,  1988. 

(ix)  "A  sieve  estimator  for  the  mean  of  a  Gaussian  process," 

International  Symposium  on  Forecasting,  Amsterdam,  June 
13,  1988. 

(x)  "Confounding  and  the  Hadamard  matrix  problem," 

Centennial  Meeting  of  Amer .  Math.  Soc . ,  Providence,  HI, 
Aug.  9,  1988  (AMS  Abs .  344-62-113). 


b.  Consultative  and  advisory  functions:  Joint  research  with  A. 
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7.  New  discoveries,  inventions,  patents .-  none. 


